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Abstract — This paper introduces a class of structured low- 
density parity-check (LDPC) codes whose parity check matrices 
are arrays of permutation matrices. The permutation matrices 
are obtained from Latin squares and form a finite field under 
some matrix operations. They are chosen so that the Tanner 
graphs do not contain subgraphs harmful to iterative decoding 
algorithms. The construction of column-weight-three codes is 
presented. Although the codes are optimized for the Gallager 
A/B algorithm over the binary symmetric channel (BSC), their 
error performance is very good on the additive white Gaussian 
noise channel (AWGNC) as well. 

I. Introduction 

It is now well established that the error floor phenomenon, 
an abrupt degradation in the error performance of low-density 
parity-check (LDPC) codes JT] in the high signal-to-noise-ratio 
(SNR) region, is due to the presence of certain structures in 
the Tanner graph that lead to decoder failures. For iterative 
decoding, these structures are known as trapping sets. 

Ideally, LDPC codes should be designed so that their Tanner 
graphs do not contain most harmful trapping sets, but unfortu- 
nately, except for the binary erasure channel, trapping sets for 
other channels such as the BSC or the AWGNC, are only 
partially understood. Consequently, many existing methods 
of constructing LDPC codes only attempt to maximize the 
girth of the Tanner graphs pl-Q or avoid subgraphs that 
are believed to be harmful |5J. In the latter approach, the 
subgraphs are identified either by computer simulation or 
hardware emulation, or are conveniently defined to make the 
search easier. The problem in these approaches lies in the 
underlying assumption about harmfulness, which is not proven 
or is restricted to specific cases. 

In this paper, LDPC codes are constructed so that their 
Tanner graphs do not contain trapping sets of the Gallager 
A/B algorithm on the BSC. The code construction utilizes 
the Trapping Set Ontology (TSO) given by Vasic et. al. in 
Q. This database contains trapping sets for the Gallager A/B 
algorithms which are organized based on their topological 
relations. Our approach relies greatly on the relative harm- 
fulnes^of different trapping sets to determine which trapping 

1 The relative harmfulness of a trapping set given in the TSO for the Gallager 
A/B algorithm over the BSC is determined by its critical number and strength 
jTJ. The relative harmfulness of a trapping set given in the TSO for the 
sum product algorithm over the BSC is currently being studied and will be 
discussed in the journal version of this paper. 



sets should not be present in the Tanner graphs of codes. The 
choice of which trapping sets to avoid is critical to the error 
performance and the code rate. 

Although the codes are optimized for the Gallager A/B 
algorithm over the BSC, experimental results indicate that 
their error performance under other iterative algorithms such 
as the sum product algorithm (SPA) are also extremely good. 
The explanation for this is based on the observation by 
Chilappagari et. al. in [8 1 that the decoding failures for various 
decoding algorithms and channels are closely related and 
subgraphs responsible for these failures share some common 
underlying topological structures. These structures are either 
trapping sets for iterative decoding algorithms on the BSC or 
larger subgraphs containing these trapping sets. 

The above approach can be incorporated into many existing 
classes of LDPC codes to result in codes with good error 
performance (e.g., (9), JTO)), including the new class of 
structured LDPC codes that we propose in this paper. 

To be efficiently encodable and decodable, LDPC codes 
must be structured. An important class of structured LDPC 
codes are quasi-cyclic (QC) codes. In the past few years, 
numerous QC constructions have been proposed. They can be 
broadly classified as algebraic [9|, fTT^-fPJ) and combinatorial 
fl4| , fl5) . In this paper, we give a class of structured LDPC 
codes whose parity check matrices are arrays of permutation 
matrices. Our design is motivated by the work of Lan et. al. 
j9). In (9), the authors give a general algebraic construction 
of QC-LDPC codes based on a one-to-one correspondence 
between an element of the multiplicative group of GF(gr) and 
a circulant permutation matrix of size (q — 1) x (q — 1). In 
our construction, the set of permutation matrices together with 
some matrix operations (introduced later in this paper) form 
a field isomorphic to GF(q). Our permutation matrices are 
similar to circulants in the sense that the set of q — 1 non- 
identity permutation matrices form a cyclic group, but they are 
more general as the circulant property holds on indices under- 
stood as elements of GF(<7). More specifically, the permutation 
corresponding to a 1 sends the indices (0, 1, a, . . . , a q ~ 2 ) to 
(0 + a 1 , 1 + a 1 , a + a*, . . . , a q ~ 2 + a 1 ). 

The construction allows for a systematic reduction of error 
floors. We present a construction algorithm which recursively 
builds the parity check matrix by adding permutation matrices. 
The algorithm ensures that after each step, the corresponding 



Tanner graph does not contain certain trapping sets defined in 
the TSO. 

The rest of the paper is organized as follows. In Section 
III] we provide the background related to LDPC codes and 
the necessary preliminaries for the construction method. In 
Section 



III 



we give the general definition of the class of 
structured LDPC codes from permutation matrices. In Section 
|IV| we present the construction of codes based on Latin 
squares obtained from the additive group of a Galois field. In 
Section [V] we describe the construction algorithm and present 
the construction of several column-weight-three codes. Finally, 



we conclude the paper in Section VI 



II. Preliminaries 

In this section, we introduce the definitions and notation 
used throughout the paper. 

A. LDPC Codes and Trapping Sets 

Let C denote an (n, k) LDPC code over the binary field 
GF(2). C is defined by the null space of H, an m x n parity 
check matrix of C. H is the bi-adjacency matrix of G, a Tanner 
graph representation of C. G is a bipartite graph with two sets 
of nodes: n variable (bit) nodes V = {1,2, ...,n} and m 
check (constraint) nodes C = {1,2, ... ,m}. The length of 
the shortest cycle in the Tanner graph G is called the girth g 
ofG. 

A trapping set for an iterative decoding algorithm is defined 
as a non-empty set of variable nodes in G that are not 
eventually corrected by the decoder [16|. A trapping set T 
is called an (a, 6) trapping set if it contains a variable nodes 
and the subgraph induced by these variable nodes has b odd 
degree check nodes. 

B. Permutation Matrices from Latin Squares 

A permutation matrix is a square binary matrix that has 
exactly one entry 1 in each row and each column and O's 
elsewhere. Our construction makes use of permutation matri- 
ces that do not have l's in common positions. These sets of 
permutation matrices can be obtained conveniently from Latin 
squares. 

A Latin square of size q (or order q) is a q x q array in 
which each cell contains a single symbol from a q-set S, such 
that each symbol occurs exactly once in each row and exactly 
once in each column. A Latin square of size q is equivalent 
to the Cay ley table of a quasigroup Q on q elements (see [17, 
pp. 135-152] for details). 

For mathematical convenience, we use elements of Q to 
index the rows and columns of Latin squares and permutation 
matrices. Let C = [h,j]ij e Q denote the Latin square defined 
on the Cayley table of a quasigroup (Q, ©) of order q. Define 
/, an injective map from Q to Mat(g, q, GF(2)), the set of 
matrices of size q x q over GF(2), as follows: 

f:Q -> Mat(?,g,GF(2)) 
a ^ f(a) = [mi,j] iJgQ 



such that: 

1 if U j = a 
if k.j 7^ a 

It follows from the above definition that the images of 
elements of Q under / give a set of q permutation matrices 
that do not have l's in common positions. 

III. LDPC Codes as Array of Permutation Matrices 

In this section, we give the general definition of LDPC 
codes whose parity check matrices are arrays of permutation 



matrices. Let W 
a quasigroup Q, 



W = 



Jl >3h<i<ix,l<j<T] 



be an /i x r\ matrix over 



w 2 ,i 



»1,2 



W 1,V 

w 2 , v 



(1) 



With some abuse of notation, let H = /(W) = [/(itfj.j)] 
be an array of permutation matrices, obtained by replacing 
elements of W with their images under /, i.e., 



H 



/Ki) 

f( w 2,l) 



f( w l,2) 
f( w l,2) 



f( w i,v) 

f(u>2, V ) 



(2) 



/Kl) /(^M,2) ••• /K^) 

Then H is a binary matrix of size pq x r\q. The null space of 
H gives an LDPC code of length -qq. The column weight and 
row weight of T~L are p and r/, respectively. 

IV. Structured LDPC Codes from Galois Fields of 

Permutation Matrices 
A. Galois Fields of Permutation Matrices 

Consider the Galois field GF(g), where q is a power of a 
prime. Let a be a primitive element of GF{q). The powers 
of a, a~~°° = 0,a° = 1, a, a 2 , . . . , a q ~ 2 , give all the q 
elements of GF{q) and a 9-1 = 1. Let C = [kj] i j £ GF(g) 
denote the Latin square defined by the Cayley table of the 
quasigroup given by the set {0, 1, a, ... , a 9 ~ 2 } together with 
the subtractive operation of GF(q), i.e., l i: j = i — j. Let 
M = {M_ OOJ Mo, Mi, ... , M g _ 2 } be the set of images of el- 
ements of GF(q) under /, i.e., M t = [mfj]. je Qp^ = /(«*)• 
It is easy to see that M-oo = /, the q x q identity matrix. 
To show that A4 forms a field isomorphic to GF(g) under 
the matrix operations defined below, we give the following 
propositions. Due to page limitations, the proofs are omitted. 

Proposition 1: For all t\,t% € Z, /(a* 1 +a' 2 ) = M tl M t2 . 

Proposition 2: For all t > 0, M t+1 = PM t Q, where P € 
M qxq [GF(2)} is given as 







1 



(3) 



and Q is the transpose of P. 

Define the addition EH and the multiplication □ on A4 as: 

M tl fflM t2 = M tl M t2 , 

M tl □ M t2 = p(*="*i) M tl Q [t2 - tl] 

then it can be shown that M. together with ES and □ form a 
field isomorphic to GF(q). 

B. LDPC Codes from Galois Fields of Permutation Matrices 

Define VV and H as in ([TJ and d2), where Q is the set 
{0, 1, a, . . . , a 9 " 2 } together with the subtractive operation 
of GF(q). The following theorem gives the necessary and 
sufficient condition on VV, such that the Tanner graph cor- 
responding to H has girth at least 6. 

Theorem 1 ( Cross-addition Constraint): The Tanner graph 
corresponding to H contains no cycle of length four iff j 1 + 

w i 2 ,j2 u>h,h + w i2,h for any 1 < «i,«2 < M> 1 < 3uh < 
k ^ k,h ^ 32- 

Proof: The proof is omitted due to page limitations. ■ 
It can be seen that the construction of LDPC codes with 
girth at least 6 from a Galois field of permutation matrices 
reduces to the finding of a matrix VV that satisfies the cross- 
addition constraint. One form of VV that satisfies the cross- 
addition constraint is given by 
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Let H = /(VV). From Proposition |5j it follows that % has 
the following structure: 
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(5) 



where M t — P t MoQ t and / is the q x q identity matrix. H 
is an array of permutation matrices from A4 and is a q 2 x q 2 
matrix over GF(q) with both row and column weights q. Since 
W satisfies the cross-addition constraint, the Tanner graph 
corresponding to H contains no cycle of length 4. 

For any pair (7, p) of positive integers with 1 < 7, p < q, 
let H be a 7 x p subarray of W. Then H is a 7(7 x pq matrix 
over GF(2) which is also free of cycles of length 4. H has 
constant column 7 and row weight p. The null space of H 
gives a regular structured LDPC code C of length pq with rate 
at least (p — 7)/ ' p JTl. 

Remarks: 

• The matrix VV in Q is obtained by adding a row and 
a column of all zeros to C, where C is the Latin square 
obtained from the Cayley table of the multiplicative group 
ofGF(g). 



• The codes given in this paper can be alternatively defined 
on integer latices. Since array LDPC codes introduced by 
Fan in fTU) , can be defined on integer lattices as shown 
in fTH) , they are special cases of the codes given in this 
paper. If q is a prime, then the parity check matrices of 
array LDPC codes are subarrays of f(W p ), where W p is 
obtained by permuting rows and columns of VV (in Q). 
The codes by Lan et. al. J5) based on the additive groups 
of prime fields are also array LDPC codes. 

• Our class of codes is also different from codes given 
by Gabidulin et. al in [19| (except for codes based on 
prime fields, for which the latter become array codes). 
In fT9) , permutation matrices of size q x q, where q is a 
power of a prime, are obtained from the Tensor product 
of circulant matrices of size pxp, thus are different from 
the permutation matrices in d5J, 

• If £ is defined by the Cayley table of the multiplicative 
group of GF(q), then circulant permutation matrices of 
size (q — 1) x [q — 1) are obtained as images of elements 
of GF(g)\{0}. In such case, the necessary and sufficient 
condition on VV such that the Tanner graph corresponding 
to H has girth at least 6 is called the cross-multiplication 
constraint. This condition can be obtained from the cross- 
addition constraint by replacing addition with multiplica- 
tion. This gives an alternative description for the codes 
described in |(9). 

V. Construction of Codes Free of Small Trapping 

Sets 

The description of the class of LDPC codes given in 
the previous sections along with Theorem [T] allow us to 
construct codes by progressively building the Tanner graphs. 
The construction is performed by an algorithm which forms 
the matrix VV in ([TJ. The algorithm is based on a check and 
select-or-disregard procedure. Let t specify which graphical 
structures should not be present in the Tanner graph G. For 
example, Figure[TJshows the subgraphs induced by some small 
trapping sets, t may specify the girth of G and may also 
specify the minimum distance of the code. For column-weight- 
three codes, all possible codewords of even weight less than 12 
are known and their induced subgraphs are listed in the TSO. 
It is simple to check the Tanner graph for cycles of length four 
thanks to Theorem [TJ Finding girth of the Tanner graph can be 
done in polynomial time using the Dijkstra or Bellman-Ford 
algorithm, while enumerating cycles of a given length using 
a standard tree-based algorithm has linear complexity in the 
code length |6). An efficient search of the Tanner graph for 
trapping sets relies on the topological relations among them 
and carefully analyzing the induced subgraphs. Details on the 
graph searching techniques will be given in the journal version 
of this paper. 

The Tanner graph of a code is built progressively in p stages, 
where p is the row weight of the parity check matrix. Usually, 
p is not pre-specified, and codes are constructed to have rate 
as high as possible. At each stage, a set of q variable nodes 
are introduced, initially not connected to check nodes of the 






(a) 



(b) 



(c) 



Fig. 1. |(a)| (5, 3) trapping set of girth 6, |(b)| (5, 3) trapping set of girth 8 and 
|(c)| (6, 4) trapping set. We use • to represent variable nodes, ■ to represent 
odd degree check nodes and □ to represent even degree check nodes. 



Tanner graph. Blocks of edges are then added to connect the 
new variable nodes and the check nodes. Each block of edges 
corresponds to a permutation matrix and hence corresponds 
to an element of GF(q). An element of GF(q) may be chosen 
randomly, or it may be chosen in a predetermined order. After 
a block of edges is added, the Tanner graph is checked for 
condition r. If the condition r is violated, then that block 
of edges is removed and replaced by a different block. The 
algorithm proceeds until no block of edges can be added 
without violating condition r. It can be seen that the algorithm 
is a combination of the progressive edge growth algorithm for 
constructing random LDPC codes [ 20 ] and the method in 1 18 1. 

The complexity of the algorithm grows exponentially with 
the column weight. The complexity also depends greatly on 
how condition r is checked on a Tanner graph. However, for 
small column weights, say 3 or 4, and small to moderate 
code lengths, the algorithm is well handled by state-of-the- 
art computers. For example, the construction of a (1111,808) 
code which has girth 8, minimum distance at least 10 and 



which contains no (5, 3) trapping set given in Figure 1 b) takes 
less than 2 minutes on a 2.4 GHz computer. 

We continue this section by providing two examples of con- 
struction of column-weight-three codes whose Tanner graphs 
do not contain small trapping sets described in the TSO. 

Example 1: Let q — 53 and let Cg" ™"* denote the code ob- 
tained when t is defined as a condition that the corresponding 



Tanner graph of Cg llcw ' 1 has girth g > 



.<4 ncw) is a (530, 371) 
LDPC code with rate R = 0.7. Let Cg° ld) denote the (530, 
371) shortened array code (or integer lattice code described in 
Jl8)). Cg° ld ' ) is obtained by extensive computer search and has 
the maximum possible rate of R = 0.7. Denote by C^ 1Q the 
code obtained when r is such that the minimum distance of 
Cfi 10 is at least 10. C^ w is constructed by avoiding codewords 
of weight 6 and 8 in the Tanner graph (the TSO lists two 
possible codewords of weight 6 and five possible codewords 
of weight 8 for codes with g > 6). Ca w is a (795, 636) 
LDPC code with rate R = 0.8 and girth g = 6. The error 



performance of C, 



(new) ^(old) 



8 



and C^ 10 under the SPA with 
a maximum of 50 iterations over the AWGNC is shown in 
Figure [2] It can be seen that the error performance of Cg 11GW ^ 

One possible explanation for 



is better than that of C 



(old) 




Fig. 2. Performances of the codes given in Example^ over the AWGNC. 



subgraphs induced by the codeword of weight 6 while the 
minimum distance of Cg" cw ' is 10. Allowing the Tanner graph 
of Cfi 10 to have girth 6 but requiring that the minimum distance 
is at least 10 results in higher rate than the rate of C% ncw \ while 
maintaining the good error performance. This example clearly 
demonstrates that larger girth alone does not necessarily lead 
to better performance. We also remark that although minimum 
distance is used as the design parameter to construct C^ w with 
good error performance, in general this may not be sufficient 
to guarantee low error floors, since codes with high minimum 
distance may still contain trapping sets. 

Example 2: Let q = 19 2 = 361. The codes obtained when 
conditions t%, t 2 , T3 and T4 are imposed are denoted with C\, 
C 2 , C3 and C4, respectively. Conditions t\, t 2 , t 3 and r 4 are 
defined as 

• n: G has girth g > 10. 

• r 2 . G has girth g > 8; G does not contain the (5,3) 
trapping set of girth 8 shown in Figure |l|[b)| and G does 



not contain the (6,4) trapping set shown in Figure 1 c) 



T3: G has girth g > 8; G does not contain the (5,3) 
trapping set of girth 8 and an eight cycle in G can share 
2 variable nodes with at most one other eight cycle. 
T4: G has girth g > 6; G does not contain the (5,3) 



this observation is that the Tanner graph of Cg°' d ^ contains 



trapping set of girth 6 shown in Figure l a), G does not 
contain the (5,3) trapping set of girth 8; and an eight 
cycle in G can share 2 variable nodes with at most one 
other eight cycle. 

The Tanner graphs of these codes contain 361 check nodes. 
Cx, C 2 , C 3 and C 4 have lengths m = 2888, n 2 = 3249, n 3 = 
3610, n 4 = 3971 and rates R t = 0.63, R 2 = 0.67, R 3 = 
0.70, i?4 = 0.73. The error performance of C±, C 2 , C3 and 
C4 under the SPA with a maximum of 50 iterations over the 
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Fig. 3. Performances of the codes given in Example [5] over the AWGNC. 

AWGNC is shown in Figure [3] 

It can be seen that the conditions t\ , ti , T3 and T4 are 
successively weaker. Since stronger conditions usually lead 
to codes with lower rates, we can observe in this example 
that Ri < i? 2 < i?3 < R 4 . From the simulation results, 
we see no loss in the error performance of codes with 
weaker conditions. This example emphasizes the importance 
of properly identifying the trapping sets to be avoided in the 
Tanner graph since it is crucial for the rate and the error 
performance of the code. 

Remark: Conditions T3 and T4 permit an eight cycle in G 
to share 2 variable nodes with at most one other eight cycle. 
Consequently, (6, 4) trapping sets can be present in the Tanner 
graphs but their variable nodes can be involved in at most two 
eight cycles. Therefore many children of the (6, 4) trapping 
set are avoided (see [6] for more details). 

VI. Discussion and Conclusions 

We have introduced a class of structured LDPC codes with 
a wide range of rates and lengths. The code description is 
based on Latin squares, hence they can be explained both 
algebraically or combinatorially. Moreover, the description al- 
lows a code construction by progressively building the Tanner 
graph. The Tanner graph is built so that it does not contain a 
predefined set of trapping sets of iterative decoding algorithms. 
In this paper, we rely on the TSO - a database of trapping sets 
for the Gallager A/B algorithm on the BSC. Our conjecture 
is that trapping sets for other iterative decoding algorithms 
such as the SPA must contain trapping sets for the Gallager 
A/B algorithm. By eliminating trapping sets listed in the TSO, 
the codes have good error performance when decoded by 
other iterative decoding algorithms on the BSC or AWGNC. 
Although we could not provide enough experimental results 
for comparison with existing codes due to page limitations, 
to the best of our knowledge, our codes outperform the best 



known short length structured LDPC codes. Our current and 
future work includes identifying trapping sets for various 
decoding algorithms over the BSC and AWGNC, with the TSO 
as a starting point. 
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